ISCAS in English-Chinese CLIR at NTCIR-5
نویسندگان
چکیده
We participated in the Chinese single language information retrieval(SLIR) C-C task and EnglishChinese cross-language information retrieval(CLIR) E-C tasks in NTCIR5. Our project concentrates on the two aspects of the CLIR research: 1) We test various IR models especially language models for Chinese SLIR using the training corpus provided by the NTCIR organizer, and different smoothing methods have been studied for Chinese SLIR; 2) Our C-E CLIR task is based on the dictionary-based translation approach, and a new context-based translation algorithm using web corpus is proposed to solve the outof-vocabulary(OOV) problem in CLIR.
منابع مشابه
ISCAS in CLIR at NTCIR-6: Experiments with MT and PRF
We participated in the English-Chinese cross-language information retrieval (CLIR) E-C tasks in NTCIR6. Considering the special feature of crossing two different languages in CLIR, our main concerns in our experiment are 1) to evaluate the appropriateness of MT as a means of query translation in CLIR, 2) to evaluate the effect of feedback in retrieval model to the performance of CLIR which has ...
متن کاملISCAS in CLIR at NTCIR-6
We participated in the English-Chinese cross-language information retrieval (CLIR) E-C tasks in NTCIR6. Considering the special feature of crossing two different languages in CLIR, our main concerns in our experiment are 1) to evaluate the appropriateness of MT as a means of query translation in CLIR, 2) to evaluate the effect of feedback in retrieval model to the performance of CLIR which has ...
متن کاملUsing the Web for Translation Disambiguation: RMIT University at NTCIR-5 Chinese-English CLIR
RMIT University participated in the Chinese– English NTCIR-5 CLIR task. In previous work, researchers have relied on the test collection corpus to perform translation disambiguation in CLIR. In a production system, one would not be able to use a constrained test collection for disambiguation. Therefore we are interested to see how well our techniques perform when the web is used to provide cont...
متن کاملOverview of CLIR Task at the Fifth NTCIR Workshop
The purpose of this paper is to overview research efforts at the NTCIR-5 CLIR task, which is a project of large-scale retrieval experiments on cross-lingual information retrieval (CLIR) of Chinese, Japanese, Korean, and English. The project has three sub-tasks, multi-lingual IR (MLIR), bilingual IR (BLIR), and single language IR (SLIR), in which many research groups from over ten countries are ...
متن کاملJustsystem-Clairvoyance CLIR Experiments at NTCIR-4 Workshop
At the NTCIR-4 workshop, Justsystem Corporation and Clairvoyance Corporation collaborated in participating in the Cross-Language Retrieval Task (CLIR). We submitted results to the sub-tracks of SLIR and BLIR. For the SLIR track, we submitted Chinese, English, and Japanese monolingual runs. For the BLIR track, we submitted Japanese-English and Chinese-English runs. The major goal of our particip...
متن کامل